A Hardware Algorithm for The Exact Subsequence Matching Problem in DNA Strings

نویسندگان

  • Octavian CREŢ
  • Zsolt MATHE
  • Paul CIOBANU
  • Sonia MĂRGINEAN
  • Adrian DĂRĂBANT
چکیده

This paper introduces an algorithm for DNA string detection and proposes an efficient hardware implementation of it on FPGA devices. Its main application field is intended to be the detection of intron and exon strings in DNA chains, but its applicability is not limited to Genetics. The GenDiv algorithm is based on the dynamic programming method. For the software implementation, the algorithm’s complexity is O(m · n), where m and n are the sizes of the two DNA strings being processed; in hardware, after a few adaptations that are presented, the algorithm can be implemented in a systolic array and its running time becomes linear O(m + n). Simultaneously, the necessary circuitry resources in the FPGA chip are also showing a linear trade-off. The simulations and tests that have been run show a gain of speed of several orders of magnitude of the hardware implementation over the software one.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Different Longest Common Subsequence Algorithms

The longest common subsequence is a classical problem which is solved by using the dynamic programming approach. The LCS problem has an optimal substructure: the problem can be broken down into smaller, simple "subproblems", which can be broken down into yet simpler subproblems, and so on, until, finally, the solution becomes trivial. The LCS problem also has overlapping subproblems: the soluti...

متن کامل

On the Longest Common Parameterized Subsequence

The well-known problem of the longest common subsequence (LCS), of two strings of lengths n and m respectively, is O(nm)-time solvable and is a classical distance measure for strings. Another well-studied string comparison measure is that of parameterized matching, where two equal-length strings are a parameterized-match if there exists a bijection on the alphabets such that one string matches ...

متن کامل

Window - Accumulated Subsequence matching Problem

Given two strings, text t of length n, and pattern p = p1 : : : pk of length k, and given a natural number w, the subsequence matching problem consists in nding the number of size w windows of text t which contain pattern p as a subsequence, i.e. the letters p1 ; : : : ; pk occur in the window, in the same order as in p, but not necessarily consecutively (they may be interleaved with other lett...

متن کامل

Solving the Longest Common Subsequence (LCS) Problem using the Associative ASC Processor with Reconfigurable 2D Mesh

Solving the Longest Common Subsequence (LCS) Problem using the Associative ASC Processor with Reconfigurable 2D Mesh Virdi Sabegh Singh, Hong Wang, Robert A. Walker Computer Science Department, Kent State University, Kent OH 44242 USA Abstract As new genes are sequenced, it is necessary for molecular biologists to compare the new gene’s biosequence to known sequences. One simple form of DNA seq...

متن کامل

Two Algorithms for LCS Consecutive Suffix Alignment

The problem of aligning two sequences A and B to determine their similarity is one of the fundamental problems in pattern matching. A challenging, basic variation of the sequence similarity problem is the incremental string comparison problem, denoted Consecutive Suffix Alignment, which is, given two strings A and B, to compute the alignment solution of each suffix of A versus B. Here, we prese...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009